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Description 

The present invention relates to fusion polypeptides where two individual polypeptides or parts thereof are 
fused to form a single amino acid chain. Such fusion may arise from the expression of a single continuous cod- 

5 ing sequence formed by recombinant DNA techniques. 

Fusion polypeptides are known, for example those where a polypeptide which is the ultimately desired 
product of the process is expressed with an N-terminal leader sequence" which encourages or allows secre- 
tion of the polypeptide from the cell. An example is disclosed in EP-A-116 201 (Chiron). 

Human serum albumin (HSA) is a known protein found in the blood. EP-A-147 198 (Delta Biotechnology) 

10 discloses its expression in a transformed host, in this case yeast. Our earlier application EP-A-322 094 dis- 
closes N-terminal fragments of HSA, namely those consisting of residues 1-n where n is 369 to 419, which 
have therapeutic utility. The application also mentions the possibility of fusing the C-terminal residue of such 
molecules to other, unnamed, polypeptides. 

One aspect of the present invention provides a fusion polypeptide comprising, as at least part of the N- 

15 terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least part of the C-terminal 
portion thereof, another polypeptide except that, when the said N-terminal portion of HSA is the 1-n portion 
where n is 369 to 419 or a variant thereof then the said polypeptide is (a) the 585 to 1578 portion of human 
f ibronectin or a variant thereof, (b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth 
factor, or a variant thereof, (d) transforming growth factor, or a variant thereof, (e) the 1-261 portion of mature 

20 human plasma f ibronectin or a variant thereof, (f) the 278-578 portion of mature human plasma f ibronectin or 
a variant thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a variant thereof, or (h) 
alpha-1-antitrypsin or a variant thereof. 

The N-terminai portion of HSA is preferably the said 1-n portion, the 1-177 portion (up to and including 
the cysteine), the 1-200 portion (up to but excluding the cysteine) or a portion intermediate 1-177 and 1-200. 

25 The term "human serum albumin" (HSA) is intended to include (but not necessarily to be restricted to) 

known or yet-to-be-discovered polymorphic forms of HSA. For example, albumin Naskapi has Lys-372 in place 
of Glu-372 and pro-albumin Christchurch has an altered pro-sequence. The term "variants" is intended to in- 
clude (but not necessarily to be restricted to) minor artificial variations in sequence (such as molecules lacking 
one or a few residues, having conservative substitutions or minor insertions of residues, or having minor va- 

30 nations of amino acid structure). Thus polypeptides which have 80%, preferably 85%, 90%, 95% or 99%, hom- 
ology with HSA are deemed to be "variants". It is also prefer red for such variants to be physiologically equivalent 
to HSA; that is to say, variants preferably share at least one pharmacological utility with HSA. Furthermore, 
any putative variant which is to be used pharmacologically should be non-immunogenic in the animal (espe- 
cially human) being treated. 

35 Conservative substitutions are those where one or more amino acids are substituted for others having sim- 

ilar properties such that one skilled in the art of polypeptide chemistry would expect at least the secondary 
structure, and preferably the tertiary structure, of the polypeptide to be substantially unchanged. For example, 
typical such substitutions include asparagine for glutamine, serine for asparagine and arginine for lysine. Va- 
riants may alternatively, or as well, lack up to ten (preferably only one or two) intermediate amino acid residues 

40 (ie not at the termini of the said N-terminal portion of HSA) in comparison with the corresponding portion of 
natural HSA; preferably any such omissions occur in the 100 to 369 portion of the molecule (relative to mature 
HSA itself) (if present). Similarly, up to ten, but preferably only one or two, amino acids may be added, again 
in the 100 to 369 portion for preference (if present). The term "physiologically functional equivalents" also en- 
compasses larger molecules comprising the said sequence plus a further sequence at the N-terminal (for ex- 

45 ample, pro-HSA, pre-pro-HSA and met-HSA). 

Clearly, the said "another polypeptide" in the fusion compounds of the invention cannot be the remaining 
portion of HSA, since otherwise the whole polypeptide would be HSA, which would not then be a "fusion poly- 
peptide". 

Even when the HSA-like portion is not the said 1-n portion of HSA, it is preferred for the non-HSA portion 
so to be one of the said (a) to (h) entities. 

The 1 to 368 portion of CD4 represents the first four disulphide-linked immunoglobulin-like domains of the 
human T lymphocyte CD4 protein, the gene for and amino acid sequence of which are disclosed in D. Smith 
et al (1987) Science 328, 1704-1707. It is used to combat HIV infections. 

The sequence of human platelet-derived growth factor (PDGF) is described in Collins et al (1985) Nature 
55 316, 748-750. Similarly, the sequence of transforming growth factors (3 (TGF-P) is described in Derynck et al 
(1985) Nature 316 , 701-705. These growth factors are useful for wound-healing. 

A cDNA sequence for the 1-261 portion of Fn was disclosed in EP-A-207 751 (obtained from plasmid pFH6 
with endonuclease Pyull). This portion binds fibrin and can be used to direct fused compounds to blood clots. 
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A cDNA sequence for the 278-578 portion of Fn, which contains a collagen-binding domain, was disclosed 
by R.J. Owens and RE. Baralle in 1986 E.M.B.O.J. 5, 2825-2830. This portion will bind to platelets. 

The 1-272 portion of von Willebrand's Factor binds and stabilises factor VIII. The sequence is given in Bon- 
tham et al, Nucl. Acids Res. 14, 7125-7127. 
5 Variants of alpha-1 -antitrypsin include those disclosed by Rosenburg et al (1984) Nature 312 , 77-80. In 

particular, the present invention includes the Pittsburgh variant (Met 356 is mutated to Arg) and the variant where 
Pro 357 and Met 358 are mutated to alanine and arginine respectively. These compounds are useful in the treat- 
ment of septic shock and lung disorders. 

Variants of the non-HSA portion of the polypeptides of the invention include variations as discussed above 
10 in relation to the HSA portion, including those with conservative amino acid substitutions, and also homologues 
from other species. 

The fusion polypeptides of the invention may have N-terminal amino acids which extend beyond the por- 
tion corresponding to the N-terminal portion of HSA. For example, if the HSA-like portion corresponds to an 
N-terminal portion of mature HSA, then pre-, pro-, or pre- pro sequences may be added thereto, for example 

15 the yeast alpha-factor leader sequence. The fused leader portions of WO 90/01063 may be used. The poly- 
peptide which is fused to the HSA portion may be a naturally-occurring polypeptide, a fragment thereof or a 
novel polypeptide, including a fusion polypeptide. For example, in Example 3 below, a fragment of fibronectin 
is fused to the HSA portion via a 4 amino acid linker. 

It has been found that the amino terminal portion of the HSA molecule is so structured as to favour par- 

20 ticulariy efficient translocation and export of the fusion compounds of the invention in eukaryotic cells. 

A second aspect of the invention provides a transformed host having a nucleotide sequence so arranged 
as to express a fusion polypeptide as described above. By "so arranged", we mean, for example, that the nu- 
cleotide sequence is in correct reading frame with an appropriate RNA polymerase binding site and translation 
start sequence and is under the control of a suitable promoter. The promoter may be homologous with or het- 

25 erologous to the host. Downstream (3*) regulatory sequences may be included if desired, as is known. The 
host is preferably yeast (for example Saccharomyces spp., e.g. S. cerevisiae ; Kluyveromyces spp., e.g. K. lac- 
tis; Pichia spp.; or Schizosaccharomyces spp., e.g. S. pombe) but may be any other suitable host such as E. 
col i, B. subtilis , Aspergillus spp., mammalian cells, plant cells or insect cells. 

A third aspect of the invention provides a process for preparing a fusion polypeptide according to the first 

30 aspect of the invention by cultivation of a transformed host according to the second aspect of the invention, 
followed by separation of the fusion polypeptide in a useful form. 

A fourth aspect of the invention provides therapeutic methods of treatment of the human or other animal 
body comprising administration of such a fusion polypeptide. 

In the methods of the invention we are particularly concerned to improve the efficiency of secretion of 

35 useful therapeutic human proteins from yeast and have conceived the idea of fusing to amino-terminat portions 
of HSA those proteins which may ordinarily be only inefficiently secreted. One such protein is a potentially 
valuable wound-healing polypeptide representing amino acids 585 to 1578 of human fibronectin (referred to 
herein as Fn 585-1578). As we have described in a separate application (filed simultaneously herewith) this 
molecule contains cell spreading, chemotactic and chemokinetic activities useful in healing wounds. The fusion 

40 polypeptides of the present invention wherein the C-terminal portion is Fn 585-1578 can be used for wound 
healing applications as biosynthesised, especially where the hybrid human protein will be topically applied. 
However, the portion representing amino acids 585 to 1578 of human fibronectin can if desired be recovered 
from the fusion protein by preceding the first amino acid of the fibronectin portion by amino acids comprising 
a factor X cleavage site. After isolation of the fusion protein from culture supernatant, the desired molecule is 

45 released by factor X cleavage and purified by suitable chromatography (e.g. ion-exchange chromatography). 
Other sites providing for enzymatic or chemical cleavage can be provided, either by appropriate juxtaposition 
of the N-terminal and C-terminal portions or by the insertion therebetween of an appropriate linker. 

At least some of the fusion polypeptides of the invention, especially those including the said CD4 and vWF 
fragments, PDGF and o^AT, also have an increased half-life in the blood and therefore have advantages and 

so therapeutic utilities themselves, namely the therapeutic utility of the non-HSA portion of the molecule. In the 
case of a! AT and others, the compound will normally be administered as a one-off dose or only a few doses 
over a short period, rather than over a long period, and therefore the compounds are less likely to cause an 
immune response. 

55 EXAMPLES : SUMMARY 

Standard recombinant DNA procedures were as described by Maniatis etal (1 982 and recent 2nd edition) 
unless otherwise stated. Construction and analysis of phage M13 recombinant clones was as described by 
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Messing (1983) and Sanger et al (1977). 

DNA sequences encoding portions of human serum albumin used in the construction of the following mol- 
ecules are derived from the plasmids mHOB12 and pDBD2 (EP-A-322 094, Delta Biotechnology Ltd, relevant 
portions of which are reproduced below) or by synthesis of oligonucleotides equivalent to parts of this se- 

5 quence. DNA sequences encoding portions of human f ibronectin are derived from the pi asm id pFHDEM , or 
by synthesis of oligonucleotides equivalent to parts of this sequence. Plasmid pFHDELI, which contains the 
complete human cDNA encoding plasma f ibronectin, was obtained by ligation of DNA derived from plasmids 
pFH6, 16, 54, 154 and 1 (EP-A-207 751; Delta Biotechnology Ltd). 

This DNA represents an mRNA variant which does not contain the 'ED' sequence and had an 89-amino 

10 acid variant of the lll-CS region (R.J. Owens, A.R. Kornblihtt and F.E. Baralle (1986) Oxford Surveys on Eu- 
karyotic Genes 3 1 41-160). The map of this vector is disclosed in Fig. 11 and the protein sequence of the mature 
polypeptide produced by expression of this cDNA is shown in Fig. 5. 

Oligonucleotides were synthesised on an Applied Biosystems 380B oligonucleotide synthesiser according 
to the manufacturer's recommendations (Applied Biosystems, Warrington, Cheshire, UK). 

15 An expression vector was constructed in which DNA encoding the HSA secretion signal and mature HSA 
up to and including the 387th amino acid, leucine, fused in frame to DNA encoding a segment of human f ibro- 
nectin representing amino acids 585 to 1578 inclusive, was placed downstream of the hybrid promoter of EP- 
A-258 067 (Delta Biotechnology), which is a highly efficient galactose-inducible promoter functional in Sac- 
charomyces cerevisiae . The codon for the 1578th amino acid of human f ibronectin was directly followed by a 

20 stop codon (TAA) and then the S. cerevisiae phosphoglycerate kinase (PGK) gene transcription terminator. 
This vector was then introduced into S. cerevisiae by transformation, wherein it directed the expression and 
secretion from the cells of a hybrid molecule representing the N-terminal 387 amino acids of HSAC-terminally 
fused to amino acids 585 to 1578 of human f ibronectin. 

In a second example a similar vector is constructed so as to enable secretion by S. cerevisiae of a hybrid 

25 molecule representing the N-terminal 195 amino acids of HSA C- terminally fused to amino acids 585 to 1578 
of human f ibronectin. 

Aspects of the present invention will now be described by way of example and with reference to the ac- 
companying drawings, in which: 

Figure 1 (on two sheets) depicts the amino acid sequence currently thought to be the most representative 
30 of natural HSA, with (boxed) the alternative C-termini of HSA(1-n); 

Figure 2 (on two sheets) depicts the DNAsequence coding for mature HSA, wherein the sequence included 
in Linker 3 is underlined; 

Figure 3 illustrates, diagram matically, the construction of mHOB16; 

Figure 4 illustrates, diagram matically, the construction of pHOB31 ; 
35 Figure 5 (on 6 sheets) illustrates the mature protein sequence encoded by the Fn plasmid pFHDELI; 

Figure 6 illustrates Linker 5, showing the eight constituent oligonucleotides; 

Figure 7 shows schematically the construction of plasmid pDBDF2; 

Figure 8 shows schematically the construction of plasmid pDBDFS; 

Figure 9 shows schematically the construction of plasmid pDBDF9; 
40 Figure 10 shows schematically the construction of plasmid DBDF12, using plasmid pFHDELI; and 

Figure 11 shows a map of plasmid pFHDELI. 

EXAMPLE 1 ; HSA 1-387 FUSED TO Fn 585-1578 

45 The following is an account of a preparation of plasmids comprising sequences encoding a portion of HSA, 
as is disclosed in EP-A-322 094. 

The human serum albumin coding sequence used in the construction of the following molecules is derived 
from the plasmid M13mp19.7 (EP-A-201 239, Delta Biotech- nology Ltd.) or by synthesis of oligonucleotides 
equivalent to parts of this sequence. Oligonucleotides were synthesised using phosphoramidite chemistry on 

so an Applied Biosystems 380B oligonucleotide synthesizer according to the manufacturer's recommendations 
(AB Inc., Warrington, Cheshire, England). 

An oligonucleotide was synthesised (Linker A) which represented a part of the known HSA coding se- 
quence (Figure 2) from the Pstl site (1235-1240, Figure 2) to the codon for valine 381 wherein that codon was 
changed from GTG to GTC: 

55 
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Linker 1 





D 


P 


H 


E 


C 


Y 


5 5' 


GAT 


CCT 


CAT 


GAA 


TGC 


TAT 


3' ACGT 


CTA 


GGA 


GTA 


CTT 


ACG 


ATA 



1247 

10 



20 



A 


K 


V 


F 


D 


E 


F 


K 


GCC 


AAA 


GTG 


TTC 


GAT 


GAA 


TTT 


AAA 


CGG 


TTT 


CAC 
1267 


AAG 


CTA 


CTT 


AAA 


TTT 


P 


L 


V 












CTT 


GTC 


3' 












GGA 


CAG 


5' 













Linker 1 was ligated into the vector M13mp19 (Norrander et al , 1983) which had been digested with Pst l 
and Hindi and the ligation mixture was used to transfect E.coli strain XL1-Blue (Stratagene Cloning Systems, 
San Diego, CA). Recombinant clones were identified by their failure to evolve a blue colour on medium con- 
30 taining the chromogenic indicator X-gal (5-bromo-4-chloro-3-indolyi-p-D-galactoside) in the present of IPTG 
(isopropylthio-p-galactoside). DNA sequence analysis of template DNA prepared from bacteriophage particles 
of recombinant clones identified a molecule with the required DNA sequence, designated mHOB12 (Figure 3). 

M13mp19.7 consists of the coding region of mature HSA in M13mp19 (Norrander et al , 1983) such that 
the codon for the first amino acid of HSA, GAT, overlaps a unique Xho l site thus: 

35 

Asp Ala 

5' CTCGAGATGCA 3' 

40 3' GAGCTCTACGT 5' 



Xho l 



45 (EP-A-210 239). M13mp19.7 was digested with Xho l and made flush-ended by S1 -nuclease treatment and 
was then ligated with the following oligonucleotide (Linker 2): 

Linker 2 

50 

5'TCTTTTATCCAAGCTTGGATAAAAGA 3' 
3'AGAAAATAGGTTCGAACCTATTTTCT 5' 

55 

HinclIII 

The ligation mix was then used to transfect E.coli XL1-Blue and template DNA was prepared from several 
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plaques and then analysed by DNA sequencing to identify a clone, pDBD1 (Figure 4), with the correct se- 
quence. 

A 1.1 kb Hindlll to Pstl fragment representing the 5' end of the HSA coding region and one half of the in- 
serted oligonucleotide linker was isolated from pDBD1 by agarose gel electrophoresis. This fragment was then 

5 ligated with double stranded mHOB12 previously digested with Hindlll and Pstl and the ligation mix was then 
used to transfect E.coli XL1 -Blue. Single stranded template DNA was prepared from mature bacteriophage par- 
ticles of several plaques. The DNA was made double stranded in vitro by extension from annealed sequencing 
primer with the Klenow fragment of DNA polymerase I in the presence of deoxy nucleoside triphosphates. Re- 
striction enzyme analysis of this DNA permitted the identification of a clone with the correct configuration, 

10 mHOB15 (Figure 4). 

The following oligonucleotide (Linker 3) represents from the codon for the 382nd amino acid of mature HSA 
(glutamate, GAA) to the codon for lysine 389 which is followed by a stop codon (TAA) and a Hindlll site and 
then a Bam HI cohesive end: 

15 Linker 3 







E 


E 


P 


Q 


N 


L 


I 


K 


J 






20 


5 ' 


GAA 


GAG 


CCT 


CAG 


AAT 


TTA 


ATC 


AAA 


TAA 


GCTTC 


3' 




3 ' 


CTT 


CTC 


GGA 


GTC 


TTA 


AAT 


TAG 


TTT 


ATT 


CGAACCTAG 


5' 



25 This was ligated into double stranded mHOB15, previously digested with Hindi and Bam HI. After ligation, 

the DNA was digested with Hindi to destroy all non-recombinant molecules and then used to transfect E.coli 
XL1-Blue. Single stranded DNA was prepared from bacteriophage particles of a number of dones and sub- 
jected to DNA sequence analysis. One clone having the correct DNA sequence was designated mHOB1 6 (Fig- 
ure 4). 

30 A molecule in which the mature HSA coding region was fused to the HSA secretion signal was created by 

insertion of Linker 4 into Bam HI and Xho i digested M13mp19.7 to form pDBD2 (Figure 4). 

Linker 4 





M 


K 


W 


V 


S 


F 


5' GATCC 


ATG 


AAG 


TGG 


GTA 


AGC 


TTT 


G 


TAC 


TTC 


ACC 


CAT 


TCG 


AAA 





S 


L 


L 


F 


L 


F 


S 


ATT 


TCC 


CTT 


CTT 


TTT 


CTC 


TTT 


AGC 


TAA 


AGG 


GAA 


GAA 


AAA 


GAG 


AAA 


TCG 



50 
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10 



15 



s 


A 


Y 


S 


R 


G 


V 


F 


TCG 


GCT 


TAT 


TCC 


AGG 


GGT 


GTG 


TTT 


AGC 


CGA 


ATA 


AGG 


TCC 


CCA 


CAC 


AAA 



R R 
CG 3' 
GCAGCT 5 ' 



In this linker the codon for the fourth amino acid after the initial methionine, ACC for threonine in the HSA 
pre-pro leader sequence (Lawn etal, 1981), has been changed to AGC for serine to create a Hindlll site. 

A sequence of synthetic DNA representing a part of the known HSA coding sequence (Lawn etal., 1981) 
(amino acids 382 to 387, Fig. 2), fused to part of the known fibronectin coding sequence (Kornblihtt et al ., 1985) 

20 (amino acids 585 to 640, Fig. 2), was prepared by synthesising six oligonucleotides (Linker 5, Fig. 6). The oli- 
gonucleotides 2, 3, 4, 6, 7 and 8 were phosphorylated using T4 polynucleotide kinase and then the oligonu- 
cleotides were annealed under standard conditions in pairs, i.e. 1+8, 2+7, 3+6 and 4+5. The annealed oligo- 
nucleotides were then mixed together and ligated with mHOB12 which had previously been digested with the 
restriction enzymes Hindi and EcoRI. The ligation mixture was then used to transfect E.coli XL1-Blue (Stra- 

25 tagene Cloning Systems, San Diego, CA). Single stranded template DNA was then prepared from mature bac- 
teriophage particles derived from several independent plaques and then was analysed by DNA sequencing. 
A clone in which a linker of the expected sequence had been correctly inserted into the vector was designated 
pDBDFI (Fig. 7). This plasmid was then digested with Pstl and Eco RI and the approx. 0.24kb fragment was 
purified and then ligated with the 1.29kb Bam HI-Pstl fragment of pDBD2 (Fig. 7) and Bam HI + EcoRI digested 

30 pUC19 (Yanisch-Perron, et al ., 1985) to form pDBDF2 (Fig. 7). 

A plasmid containing a DNA sequence encoding full length human fibronectin, pFHDELI, was digested 
with Eco RI and Xho l and a 0.77kb EcoRI-xhol fragment (Fig. 8) was isolated and then ligated with Eco RI and 
sai l digested M13 mp18 (Norrander et al ., 1983) to form pDBDF3 (Fig. 8). 

The following oligonucleotide linker (Linker 6) was synthesised, representing from the Pstl site at 4784- 

35 4791 of the fibronectin sequence of EP-A-207 751 to the codon for tyrosine 1578 (Fig. 5) which is followed by 
a stop codon (TAA), a Hindlll site and then a Bam HI cohesive end: 

Linker 6 

40 



45 



50 



G 


P 


D 


Q 


T 


E 


M 


T 


I 


E 


G 


L 


GGT 


CCA 


GAT 


CAA 


ACA 


GAA 


ATG 


ACT 


ATT 


GAA 


GGC 


TTG 


A CGT CCA 


GGT 


CTA 


GTT 


TGT 


CTT 


TAC 


TGA 


TAA 


CTT 


CCG 


AAC 



Q 


P 


T 


V 


E 


y 


Stop 


CAG 


ccc 


ACA 


GTG 


GAG 


TAT 


TAA 


GTC 


GGG 


TGT 


CAC 


CTC 


ATA 


ATT 



GCTTG 
CGAACCTAG 



55 This linker was then ligated with Pstl and Hindlll digested pDBDF3 to form pDBDF4 (Fig. 8). The following 

DNA fragments were then ligated together with Bglll digested pKV50 (EP-A-258 067) as shown in Fig. 8: 0.68kb 
EcoRI- Bam HI fragment of pDBDF4, 1 .5kb Bam HI-StuI fragment of pDBDF2 and the 2.2kb Stul-EcoRI fragment 
of pFHDELI. The resultant plasmid pDBDF5 (Fig. 8) includes the promoter of EP-A-258 067 to direct the ex- 
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pression of the HSA secretion signal fused to DNA encoding amino acids 1-387 of mature HSA, in turn fused 
directly and in frame with DNA encoding amino acids 585-1578 of human fibronectin, after which translation 
would terminate at the stop codon TAA. This is then followed by the S.cerevisiae PGK gene transcription ter- 
minator. The plasmid also contains sequences which permit selection and maintenance in Escherichia coli and 
5 S.cerevisiae (EP-A-258 067). 

This plasmid was introduced into S.cerevisiae S150-2B (leu2-3 I eu2-112 ura3-52 trp1-289 his3- 1) by stan- 
dard procedures (Beggs, 1978). Transformants were subsequently analysed and found to produce the HSA- 
fibronectin fusion protein. 

10 EXAMPLE 2 : HSA 1-195 FUSED TO Fn 585-1578 

In this second example the first domain of human serum albumin (amino acids 1-195) is fused to amino 
acids 585-1578 of human fibronectin. 

The plasmid pDBD2 was digested with Bam HI and BgHI and the 0.79kb fragment was purified and then 
15 ligated with BamHI-digested M13mp19 to form pDBDF6 (Fig. 6). The following oligonucleotide: 

5 ' -C CAAAGCTCGAGGAACTT C G-3' 

was used as a mutagenic primer to create a Xhol site in pDBDF6 by in vitro mutagenesis using a kit supplied 
20 by Amersham International PLC. This site was created by changing base number 696 of HSA from a T to a G 
(Fig. 2). The plasmid thus formed was designated pDBDF7 (Fig. 9). The following linker was then synthesised 
to represent from this newly created Xho l site to the codon for lysine 195 of HSA (AAA) and then from the 
codon for isoleucine 585 of fibronectin to the ends of oligonucleotides 1 and 8 shown in Fig. 6. 

25 Linker 7 



30 


D 

TC GAT 
A 


E 
GAA 
CTT 


L 
CTT 
GAA 


R 
CGG 
GCC 


D 
GAT 
CTA 


E 
GAA 
CTT 


G 
GGG 
CCC 


K 
AAG 
TTC 


A 
GCT 
CGA 


S 
TCG 
AGC 


S 
TCT 
AGA 


A 
GCC 
CGG 


K 
AAA 
TTT 


35 
40 


I 

ATC 
TAG 


T 
ACT 
TGA 


E 
GAG 
CTC 


T 
ACT 
TGA 


P 
CCG 
GGC 


S 
AGT 
TCA 


Q 
CAG 

GTC 


P 

C 

GGG 


N 
TTG 


S 
AGG 


H 

GTG 


G 





This linker was ligated with the annealed oligonucleotides shown in Fig. 3, i.e. 2+7, 3-1-6 and 4+5 together 
with Xho l and EcoRI digested pDBDF7 to form pDBDF8 (Fig. 9). Note that in order to recreate the original 
HSA DNA sequence, and hence amino acid sequence, insertion of linker 7 and the other oligonucleotides into 

45 pDBDF7 does not recreate the Xhol site. 

The 0.83kb Bam Hi-StuI fragment of pDBDF8 was purified and then was ligated with the 0.68kb EcoRI- 
Bam HI fragment of pDBDF2 and the 2.22kb Stul-EcoRI fragment of pFHDELI into Bg Ill-digested pKV50 to 
form pDBDF9 (Fig. 9). This plasmid is similar to pDBDF5 except that it specifies only residues 1-195 of HSA 
rather than 1-387 as in pDBDFS. 

so When introduced into S.cerevisiae S150-2B as above, the plasmid directed the expression and secretion 
of a hybrid molecule composed of residues 1-195 of HSA fused to residues 585-1578 of fibronectin. 

EXAMPLE 3 : HSA 1-387 FUSED TO Fn 585-1578, AS CLEAVABLE MOLECULE 

55 In order to facilitate production of large a mounts of residues 585- 1578 of fibronectin, a construct was made 

in which DNA encoding residues 1-387 of HSA was separated from DNA encoding residues 585-1578 of fibro- 
nectin by the sequence 
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I E G R 
ATT GAA GGT AGA 

5 

TAA CTT CCA TCT 

which specifies the cleavage recognition site for the blood clotting Factor X. Consequently the purified secreted 
product can be treated with Factor X and then the fibronectin part of the molecule can be separated from the 
10 HSA part. 

To do this two oligonucleotides were synthesised and then annealed to form Linker 8. 
Linker 8 



15 


E 


E 


P 


Q 


N 


L 


I 


E 


G 




GAA 


GAG 


CCT 


CAG 


AAT 


TTA 


ATT 


GAA 


GGT 




CTT 


CTC 


GGA 


GTC 


TTA 


AAT 


TAA 


CTT 


CCA 


20 






















R 


I 


T 


E 


T 


P 


S 


Q 


P 


25 


AGA 


ATC 


ACT 


GAG 


ACT 


CCG 


AGT 


CAG 


C 




TCT 


TAG 


TGA 


CTC 


TGA 


GGC 


TCA 


GTC 


GGG 


30 






















N 


S 


H 














35 


TTG 


AGG 


GTG 


G 













This linker was then ligated with the annealed oligonucleotides shown in Fig. 6, i.e. 2+7 , 3+6 and 4+5 into 
Hindi and Eco RI digested mHOB12, to form pDBDF10 (Fig. 7). The plasmid was then digested with Pstl and 
Eco Rl and the roughly 0.24kb fragment was purified and then ligated with the 1 .29kb BamHI-Pstl fragment of 
40 pDBD2 and Bam HI and Eco Rl digested pUC19 to form pDBDF11 (Fig. 10). 

The 1.5kb BamHI-StuI fragment of pDBDF11 was then ligated with the 0.68kb EcoRI- Bam H1 fragment of 
pDBDF4 and the 2.22kb Stul-EcoRI fragment of pFHDELI into Bqlll-digested pKV50 to form pDBDF12 (Fig. 
10). This plasmid was then introduced into S.cerevisiae S150-2B. The purified secreted fusion protein was 
treated with Factor X to liberate the fibronectin fragment representing residues 585-1578 of the native mole- 
45 cule. 
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Claims 

Claims for the following Contracting States : AT, BE, CH, LI, DE, DK, FR, IT, LU, NL, SE 

5 1. A fusion polypeptide comprising, as at least part of the N-terminal portion thereof, an N-terminal portion 
of HSAor a variant thereof and, as at least part of the C-terminal portion thereof, another polypeptide 
except that, when the said N-terminal portion of HSA is the 1-n portion where n is 369 to 419 or a variant 
thereof then the said polypeptide is (a) the 585 to 1578 portion of human fibronectin or a variant thereof, 
(b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth factor or a variant thereof, 

10 (d) transforming growth factor p or a variant thereof, (e) the 1-261 portion of mature human plasma fi- 

bronectin or a variant thereof, (0 the 278-578 portion of mature human plasma fibronectin or a variant 
thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a variant thereof, or (h) alpha- 
1 -antitrypsin or a variant thereof. 

15 2. A fusion polypeptide according to Claim 1 additionally comprising at least one N-terminal amino acid ex- 
tending beyond the portion corresponding to the N-terminal portion of HSA. 

3. A fusion polypeptide according to Claim 1 or 2 wherein there is a cleavable region at the junction of the 
said N-terminal or C-terminal portions. 

20 

4. A fusion polypeptide according to any one of the preceding claims wherein the said C-terminal portion is 
the 585 to 1578 portion of human plasma fibronectin or a variant thereof. 

5. A transformed or transfected host having a nucleotide sequence so arranged as to express a fusion poly- 
peptide according to any one of the preceding claims. 

25 

6. A process for preparing a fusion polypeptide by cultivation of a host according to Claim 5, followed by sep- 
aration of the fusion polypeptide in a useful form. 

7. A fusion polypeptide according to any one of Claims 1 to 4 for use in therapy. 

30 

Claims for the following Contracting States : ES, GR 

1. A process for preparing a fusion polypeptide by (i) cultivation of a transformed or transfected host having 
a nucleotide sequence so arranged as to express a fusion polypeptide, followed by (ii) separation of the 

35 fusion polypeptide in a useful form, characterised in that the fusion polypeptide comprises as at least part 

of the N-terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least part 
of the C-terminal portion thereof, another polypeptide except that, when the said N-terminal portion of 
HSA is the 1-n portion where n is 369 to 41 9 or a variant thereof then the said polypeptide is (a) the 585 
to 1 578 portion of human fibronectin or a variant thereof , (b) the 1 to 368 portion of CD4 or a variant there- 

40 of, (c) platelet derived growth factor or a variant t hereof, (d) transforming growth factor p or a variant there- 

of, (e) the 1-261 portion of mature human plasma fibronectin or a variant thereof, (f) the 278-578 portion 
of mature human plasma fibronectin or a variant thereof, (g) the 1 -272 portion of mature human von Will- 
ebrand's Factor or a variant thereof, or (h) alpha-1 -antitrypsin or a variant thereof. 

45 2. A process according to Claim 1, wherein the fusion polypeptide additionally comprising at least one N- 
terminal amino acid extending beyond the portion corresponding to the N-terminal portion of HSA 

3. A process according to Claim 1 or 2 wherein, in the fusion polypeptide, there is a cleavable region at the 
junction of the said N-terminal or C-termina! portions. 

50 

4. A process according to any one of the preceding claims wherein the said C-terminal portion is the 585 
to 1578 portion of human plasma fibronectin or a variant thereof. 



55 
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Patentanspruche 

Patentanspruche fur folgende Vertragsstaaten : AT, BE, CH, DE, DK, FR, IT, LU, NL, SE 

5 1. Fusionspolypeptid, umfassend als mindestens einen Tei! seines N-terminalen Teils einen N-terminalen 
Teil von HSAoder eine Variante davon und als mindestens einen Teil seines C-terminalen Teils ein wei- 
teres Polypeptid mit der Ausnahme, daft wenn es sich bei dem N-terminalen Teil von HSA urn den Teil 1- 
n mit n = 369 bis 419 oder eine Variante davon handelt, das Polypeptid aus 
(a) dem Teil 585 bis 1578 von Humanfibronectin oder einer Variante davon, 

10 (b) dem Teil 1 bis 368 von CD4 oder einer Variante davon, 

(c) dem "Platelet Derived Growth Factor" (PDGF) oder einer Variante davon, 

(d) dem Transforming Growth Factor p w (TGF P) oder einer Variante davon, 

(e) dem Teil 1-261 von reifem Humanplasmafibronectin oder einer Variante davon, 

(f) dem Teil 278-578 von reifem Humanplasmafibronectin oder einer Variante davon, 

is (g) dem Teil 1-272 von reifem Human-von Willebrand's-Faktor oder einer Variante davon oder 

(h) Alpha-1 -Antitrypsin oder einer Variante davon, besteht 

2. Fusionspolypeptid nach Anspruch 1 , zusatzlich umfassend mindestens eine N-terminale Aminosaure, die 
langer als der dem N-terminalen Teil von HSA entsprechende Teil ist 

20 

3. Fusionspolypeptid nach Anspruch 1 oder 2, bei dem sich an der Verbindung der N-terminalen oder C- 
terminalen Teile eine spaltbare Region befindet. 

4. Fusionspolypeptid nach einem der vorhergehenden Anspruche, wobei der C-terminale Teil aus dem Teil 
585 bis 1578 von Humanplasmafibronectin oder einer Variante davon besteht. 

5. Transformierter oder transf izierter Wirt mit einer Nukleotidsequenz, die so angeordnet ist, daB sie ein Fu- 
sionspolypeptid nach einem dervorhergehenden Anspruche exprimieren kann. 

6. Verfahren zur Herstellung eines Fusionspolypeptids durch Kultivieren eines Wirts nach Anspruch 5 und 
anschlie&endes Abtrennen des Fusionspolypeptids in einer geeigneten Form. 

7. Fusionspolypeptid nach einem der Anspruche 1 bis 4 zur therapeutischen Verwendung. 
Patentanspruche fur folgende Vertragsstaaten : ES, GR 

1. Verfahren zur Herstellung eines Fusionspolypeptids durch 

(i) Kultivieren eines transformierten oder transfektierten Wirts mit einer Nukleotidsequenz, die so an- 
geordnet ist, dad sie ein Fusionspolypeptid exprimiert, und 
(ii) anschlieBendes Abtrennen des Fusionspolypeptids in einer geeigneten Form, 

dadurch gekennzeichnet, da& das Fusionspolypeptid als mindestens einen Teil seines N-terminalen Teils 
einen N-terminalen Teil von HSAoder eine Variante davon und als mindestens einen Teil seines C-ter- 
minalen Teils ein weiteres Polypeptid umfaBt, mit der Ausnahme, dali wenn es sich bei dem N-terminalen 
Teil von HSA urn den Teil 1-n mit n= 369 bis 41 9 oder eine Variante davon handelt, das Polypeptid aus 

(a) dem Teil 585-1578 von Humanfibronectin oder einer Variante davon, 

(b) dem Teil 1-368 von CD4 oder einer Variante davon, 

(c) dem Platelet Derived Growth Factor oder einer Variante davon, 

(d) dem Transforming Growth Factor p oder einer Variante davon, 

(e) dem Teil 1-261 von reifem Humanplasmafibronectin oder einer Variante davon, 
(0 dem Teil 278-578 von reifem Humanplasmafibronectin oder einer Variante davon, 

(g) dem Teil 1-272 von reifem Human-von Willebrand's-Faktor oder einer Variante davon oder 

(h) a-1 -Antitrypsin oder einer Variante davon besteht. 

2. Verfahren nach Anspruch 1, wobei das Fusionspolypeptid zusatzlich mindestens eine N-terminale Ami- 
nosaure, die langer als der dem N-terminalen Teil von HSA entsprechende Teil ist, umfa&t 

55 

3. Verfahren nach Anspruch 1 oder 2, wobei sich in dem Fusionspolypeptid an der Verbindung der N-termi- 
nalen oder C-terminalen Teile eine spaltbare Region befindet. 



40 



45 
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4. Verfahren nach einem dervorhergehenden Anspruche, wobei der C-terminale Teil aus dem Teil 585-1578 
von Humanplasmafibronectin oder einer Variante davon besteht. 



5 Revendications 

Revendications pour les Etats contractants suivants : AT, BE, CH, DE f DK, FR f IT, LU, NL, SE 

1. Polypeptide fusionne comprenant en tant qu'au moins une partie de sa portion N-terminale, une portion 
w N-terminale de HSA ou d'un variant de celle-ci et, en tant qu'au moins une partie de sa portion C-termi- 

nale, un autre polypeptide sauf que, lorsque cette portion N-terminale de HSA est la portion 1-n dans la- 
quelle n est 369 a 419 ou un variant de celle-ci, ce polypeptide est (a) la portion 585 a 1578 de la fibro- 
nectine humaine ou un variant de celle-ci, (b) la portion 1 a 368 de CD4 ou un variant de celle-ci, (c) le 
facteur de croissance derive des plaquettes sanguines ou un variant de celui-ci, (d) le facteur de crois- 
15 sance p de transformation ou un variant de celui-ci, (e) la portion 1-261 de la f ibronectine mature de plas- 

ma humain ou un variant de celle-ci, (f) la portion 278-578 de la fibronectine mature de plasma humain 
ou un variant de celle-ci, (g) la portion 1-272 du facteur humain mature de von Wille brand ou un variant 
de celle-ci, ou (h) l'alpha-1 -antitrypsin e ou un variant de celle-ci. 

20 2. Polypeptide fusionne suivant la revendication 1 . comprenant de plus au moins un acide amine N-terminal 
se prolongeant au-dela de la portion correspondant a la portion N-terminale de HSA. 

3. Polypeptide fusionne suivant les revendications 1 ou 2, dans lequel tl y a une region susceptible d'etre 
coupee a la jonction de ces portions N-terminale et C-terminale. 

25 

4. Polypeptide fusionne suivant Tune quelconque des revendications precedentes, dans lequel cette portion 
C-terminale est la portion 585 a 1578 de la fibronectine de plasma humain ou un variant de celle-ci. 

5. Hdte transforms ou transfecte ayant une sequence de nucleotides arrangee de facon a exprimer un po- 
lypeptide fusionne suivant Tune quelconque des revendications precedentes. 

30 

6. Procede pour preparer un polypeptide fusionne par culture d'un hdte suivant la revendication 5, survie de 
la separation du polypeptide fusionne sous une forme utile. 

7. Polypeptide fusionne suivant Tune quelconque des revendications 1 a 4 utilisable en therapie. 

35 

Revendications pour les Etats contractants suivants : ES, GR 

1. Procede pour preparer un polypeptide fusionne par (i) la culture d'un hfite transforme ou transfecte ayant 
une sequence de nucleotides arrangee de facon a exprimer un polypeptide fusionne, suivie de (ii) la se- 

40 par at ion du polypeptide fusionne sous une forme utilie, caracterise en ce que le polypeptide fusionne 

comprend, en tant qu'au moins une partie de sa portion N-terminale, une portion N-terminale de HSAou 
d'un variant de celle-ci et, en tant qu'au moins une partie de sa portion C-terminale, un autre polypeptide 
sauf que, lorsque cette portion N-terminale de HSA est la portion 1-n dans laquelle n est 369 a 419 ou 
un variant de celle-ci, ce polypeptide est alors (a) la portion 585 a 1578 de la fibronectine humaine ou un 

45 variant de celle-ci, (b) la portion 1 a 368 de CD4 ou un variant de celle-ci, (c) le facteur de croissance 

derive des plaquettes sanguines ou un variant de celui-ci, (d) le facteur de croissance p de transformation 
ou un variant de celui-ci, (e) la portion 1-261 de la fibronectine mature de plasma humain ou un variant 
de celle-ci, (f) la portion 278-578 de la fibronectine mature de plasma humain ou un variant de celle-ci, 
(g) la portion 1-272 du facteur humain mature de von Willebrand ou un variant de celle-ci, ou (h) I'alpha- 

50 1-antitrypsine ou un variant de celle-ci. 

2. Procede suivant la revendication 1, dans lequel le polypeptide fusionne comprend de plus au moins un 
acide amine N-terminal se prolongeant au-dela de la portion correspondant a la portion N-terminale de 
HSA, 

55 

3. Procede suivant les revendications 1 ou 2 dans lequel, dans le polypeptide fusionne, it y a une region sus- 
ceptible d'etre coupee a la jonction de ces portions N-terminale et C-terminale. 
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4. Proc6d6 suivant Tune quelconque des revendications prec6dentes, dans lequel cette portion C-terminale 
est la portion 585 a 1578 de la fibronectine de plasma humain ou un variant de celle-ci. 
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FIGURE I 

10 20 
Asp Aia His Lys Ser Glu Vai Ala His Axg Phe lys Asp Lau Gly Glu Giu Asn ?he Lys 

30 40 
Aia Leu vai Leu lie Ala Phe Aia Gir. Tyr Leu Gin Gin Cys Pro Phe Glu Asp His vai 

50 50 
Lys Leu Val Asn Glu Vai Thr Glu Phe Ala Lys Thr Cys 7a 1 Ala Asp Giu Ser Ala Glu 

70 30 
Asn Cys Asp Cys Ser uau His Thr Lau Phe Gly Asp Lys Lau Cys Thr Val Ala Thr Lau 

90 1 00 

Arg Glu Thr Tyr Gly Giu Met Aia Asp Cys Cys Ala lys Gin Glu ?ro Glu Arg Asn Glu 

110 120 
Cys Phe Lau Gin His Lys Asp Asp Asn Pro Asa Leu Pro Arg Leu Val .Arg Pro Glu Val 

■ 130 140 
Asp Val Met Cys Thr Aia Phe His Asp Asn Glu Giu Thr ?he Lau Lys Lys Tyr Lau Tyr 

150 ISO 
Glu lie Ala Arg Axg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg 

170 t ao 

Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Aia Cys Leu Lau Pro 

190 200 
Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys Gin Arg Leu Lys Cys. 

210 220 
Aia Ser Leu Gin. Lys Phe Gly Glu .Arc Ala Phe Lys Ala Trp Ala Val Aia Arg Leu Ser 

230 240 
Gin Arg Phe Pro Lys Ala Giu Phe Ala Glu Val Ser Lys Lau Val Thr Asp Leu Thr Lys 

250 250 
Vai His Thr Glu Cys Cys His Gly Asp Lau Lau Glu Cys Ala Asp Asp Arg Ala Asp Lau 

270 280 
Aia Lys Tyr lie Cys Giu Asn Gin Asp Ser He Ser Ser Lys Lau Lys Giu Cys Cys Giu 

290 300 
Lys Pro Lau Lau Glu Lys Ser His Cys II 9 Ala Glu Val Glu Asn Asp Giu Met Pro Ala 

31 0 320 
Asp Lau Pro Ser Lau Ala Aia Asp Phe 7a i Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala 

330 340 
Glu Ala Lys Asp vai Phe Lau Gly Met Phe Lau Tyr Giu Tyr Ala Arg .Arg .-lis Pro Asp 

350' 350 
Tyr Ser Vai Vai Lau Leu Leu Arg Leu Ala Lys Thr Tyr Giu Thr Thr Lau Glu Lys Cys 



| 570 330 



Cys Ala Ala Aia As? Pro Sis Ciu jCys Tyr Ala Lys Vai Phe Asp Giu Phe Lys Pro Leu 
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TZG^ZZ 1 Cert 



~ " — — ^oT 

m Glu ciu Pro Gin Asn Leu. He Lys Gin Asn Cys Glu Lau Phe Glu Gin leu civ Glu 



4 I 0 ■ 

:v- Lys Phe "Gin Asn Ala Leu Lau Val Arg Tyr Thr Lys Lys Val Pro Gin Val 5, 



420 
Thr 



Pro Thr Lac Val Glu Val Ser Arg Asn Lau Gly Lys . </al Gly Ser Lys Cys Cys Lys HI 

450 4fifl 
?ro Glu Ala Lys Arg v et Pro Cys Ala Glu Asp Tyr Lau Ser Val Val Leu Asn Gin Lau 



470 

Asp Arg Val Thr Lys Cys Cys Thr Glu Ser 



Cys val Lau His Glu Lys Thr Pro Val Ser A: 480 



Glu Phe Asn Ala Glu Thr. Phe Thr Phe Sis' Ala Asp He Cys Thr Lau Ser Glu Lys ^ 

530 . , n 

Arg Gin lie Lys Lys Gin Thr Ala Leu Val Glu L 



490 - fl0 
leu val Asn Arg Arg Pro Cys Phe Sar Ala Leu Glu Val Asp Glu Thr Tyr val Pro L-/s 

510 

Phe Sis Ala Asp lis Cys Thr Lau Ser Glu Lys Glu 
30 

eu Val Lys His Lys Pro Lys Ala Thr 

uys Glu Gin Lau Lys Ala Val Met Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys 

570 „ 
Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Ciu Gly Ly S Lys Leu Val. Ala Ala Sar GU 

Ala Ala Leu Giv Leu 
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FIGURE 2 DN'A sequence coding for ma cure HSA 



10 20 3C 40 50 60 70 80 

GATGCACACAAGAGTGAGGTTGCTCATCGGTTT^J^GATTTGGCAGA.^CAAAATrTC^AAGCCrTCGTCrTGATTCCCT': 
0 A H X 5 Z V A H ?. 7 X 0 L G Z Z N 7 K A L. V L I A ? 

50 100 110 120 130 140 150 i SO 

TGCTCAGTArCTTCAGCAGTGTCCATTTGAAGATCATGTAAAATTAGTGAATGAAGTAACrGA.ArTTCCAAAAACATGTG 

AQyLQQC?ZZDHVXLV.NZVTZ7AXTC 

-.70 180 190 200 210 220 230 240 

rrGCTGATGAGTCAGCTGAAAATTGTGAO-AATCACTTCATACCCrTTTTGGAGACAAATTATGCACAGTTGCAACTCTT 

V A- D Z 3 A Z N C D X S - H T L 7 G 0 X L C T V A T L 

250 260 270 280 290 300 310 320 

CGTGAAACCTATGGTGAAATGGCTGACTGCTCTGCAAAACAA 
R Z T ? G Z M A 0 C C A X Q £ P Z R N I C ~ L Q H X Q 

330 340 350 360 370 380 390 400 

TGACAACCCAAACCTCCCCCGATTGGTGAGACCAGAGGTTGATGTGATGTGCACTGCTTTTCATGACAATGAAGAGACAT 
0 N ? N L ?RLVR?ZVDVMCTA F H D N E Z ? 

410 420 430 440 450 ' 460 470 480 

TTTTGAAAAAATACTTATATGAAATTCCCAGAAGACATCCTTAC^^ 

? l :< x y l y z i a r r a ? y 7 y a p z l l ? ? a x r 

490 500 510 520 530 540 550 560 

TATAA-AGCTGCTTTTACAGAATGTTGCCAAGCTGCTGATAAAG^TGCCTGCCTGTTGCCAAAGCTCGATGAACTTCGGGA 

Y XAA? T ZCCQAADXAACL1 ? XLDZL RD 

570 530 590 600 610 620 630 640 

TGAAGGGAAGGCTTCGTCTGCCAAACAGAGACTGAAATGTGCCAGTCT^ 

Z G X A S S A X Q R L X C A S L Q X F G I R A F X A 

650 650 570 . 680 590 700 710 720 

GGGCAGTGC^TTCGCCTGAGCC^GAGATTTCCCAAAGCTGAGTTTGCAGAAGTTTCCAAGTTAGTGACAaATCTTACC^^ 
W A V A R L S Q R 7 ? X A Z F A Z V S X L V T D L T K 

730 ' 740 750 760 770 780 790 300 

GTCCACACGGAATGCrGCCATGGAGATCTGCTTGAATGTGCTGATGACAGGGCGGACCTTGCCAACrATATCTGTGAAAA. 
V uj T Z C C HGD1LZCAD D R A D L A X Y 1 C • Z N 

310 320 830 840 850 860 370 380 

TCAGGATTCGATCTCCAGTAAACTGAAGGAATGCTGTGAAAAACCTCTGTTGGAA^AATCCCACTGCATTGCCGAAGTGG 
QDSISSXLXZCCZXPLLZXSnClAZV 

£90 900 910 920 .930 940 950 960 

AAAATGATGAGATGCCTGCTGACTTGCCTTCATTAGCTGCTGATTTTGTTGAA^GTAAGGATGTTTGCAAAAACTATGCT 
ZNDZM? ADLPSLAADFVZS X D V C X N y A 

970 980 990 1000 1010 1020 1030 1040 

GAGGCAAAGGATGTCTTCCTGGGCATGTTTTTGTATGAATATGC^AGAACGCATCCTGATTACTCTGTCGTGCTGCTGCT 
ZAXDV F LG.MFLYZYARRH? D Y 5 V V L L L 
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rlGURZ 2 Conn . 

105C 1 060 '.070 t080 1090 1100 UJ-0 1120 

GAGACTTGCCAAGACATATGAAACCACrCTACAGAAGrGCTGTGCCKrGCAGATCCTCATG^TGCrATCCCAAACTGT 
?. L A X T 'i Z T T 1 I X C C A A A D ? H £ C V A X V 

1130 1140 1150 1160 1170 1180 1190 1200 

TC C AT G AATTTAAACCrCTTCTGGAJ^G AG CCTC AG 

7 D I T X ? L V I ? Q N "XQNCiLrZQLGl 

1210 1220 1230 1240 1250 1250 1270 1280 

TACAAATTCCAGAATGCGCTATTACTTCCTTACACCAAGAAAGTACCCCAAGTGTCAACTCC^ACTCrTGTAGAGGTCTC 

Y X 7 Q N A LLVHYTK KVP.QV S T ?TL V Z V 5 

1290 1 300 1 3 1 C 1320 1330 1340 1350 1350 

AACAAACC7ACGAAAAGTGGGCAC£AAATCTTGTAAACATCC^ 

?. N 1 G X V G S X C C X H ? Z A X R M P C A Z D Y L 

1370 1330 1390 1400 1410 1420 1430 1440 

CCGTGGTCCTGAACCAGTTATGTGTGTTGCATGAGAAAACGCCAGTAAGTGACAGAGTC^CAAAATGCTGCACAGAGTCC 
5 v V L tN Q L C V L H I K T ? v S D R V T X C C 7 £ S 

1450 1450 1470 1480 1490 1500 1510 1520 

TTGGTGAACAG<;CGACCATGCTTTTCAGCTCTGGAAG7CGATGAAAC^^ 
I.VNrlR?CfSALEVDET'!rv?KH?NAETJ 

1530 1540 1550 1560 1570 1580 1590 1600 

CACCT^CCATGCAGATATATGCACAC^TTCTGAGAAGGAGAGACAAATCAAGAAACAAACTGCACTTGTTGAGCTTGTGA 
77HAD ICTLSZXZRQIXKQTAL.VZ1V 

16J0 1620 1630 1540 1650 1660 :670 1530 

AACACAAGCCCAAGGCAACAAAAGAGCAACTGAAAGCTGTTATGGATGATTTCGCAGC7T7TGTAGAGAAGTGCTGCAAG 
K H X ? X A 7 X Z Q L X A V M D D 7 A A 7 V Z X C C X 

1690 1700 1710 1720 1730 1740 1750 1760 

GCTGACGATAAGGAGACCTCCTTTGCCGAGGAGGGTAAAAAACTrGTTGCTGCAAGTCAAGCTGCCTTAGGCTTATAACA 
A D DX Z T C 7 A ZI'G X KL VA A S Q AAL G L 

1770 1780 
TCTACATTTAAAAGCATCrCAG 
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GAAGAGCCTCAGAATTTAATCACTGAGACTCCGAGTCAGCCCAACTCCCACCCCATCCAGTGG 
^ CTTCTCGGAGTCTTAAATTAGTGACTCTGAGGCTCAGTCGGGTTGAGGGTGGGGTAGGTCACC 



eepqnlite 



tpsgpnshpiq 



AATGCAC CACAG C CATCT CACATTTC CAAG TACATTCTCA.GGTGGAGAC C TAAAAATT C TGTA 
TTACGTGGTGTCGGTAGAGTGTAAAGGTTCATGTAAGAGTCCACCTCTGGATTTTTAAGACAT 



n a p q p 
7 



s hi s. k y i 1 rwr p k n s v 



GGC C GTTGGAAGGAAGC TAC CATAC CAGGC CACTTAAACTC CTACAC CAT CAAAGGC C TG 
CCGGCAACCTTCCTTCGATOTTATGGTCCGGTGAATTTGAGGATCTGGTAGTTTCCGGACTTAA 



rvkeatipghlns 



y t i k g 1 



Figure S Linker 5 showing the eight constituent oligonucleotides 
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Fig. 7 Construction of pDBDF2 
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E 



Fig. 8 Construction of pDBDF5 
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• 9 Construction of pDBDF9 
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St 
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Figure LI 

Name; pFHDELl 

Vector: pUC\S Amp** 2360bp 

Insert: hFNcDNA - 7630bp 
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